A Simple and Efficient Skew Detection Algorithm via Text Row Algorithm
نویسنده
چکیده
document image processing, skew detection An important part of any document recognition system is detection of skew in the image of a page. This paper presents a new, accurate and robust skew detection algorithm based on a method for finding rows of text in page images. Results of a test of the new algorithm and a comparison against Baird's well known algorithm on 400 pages show the new algorithm to be more accurate, robust and somewhat faster. In particular the new algorithm only breaks down at skew angles in excess of 15 degrees, compared to the almost uniform distribution of breakdowns ofBaird's algorithm.
منابع مشابه
A Family of Skew-Slash Distributions and Estimation of its Parameters via an EM Algorithm
Abstract. In this paper, a family of skew-slash distributions is defined and investigated. We define the new family by the scale mixture of a skew-elliptically distributed random variable with the power of a uniform random variable. This family of distributions contains slash-elliptical and skew-slash distributions. We obtain the moments and some distributional properties of the new family of d...
متن کاملSkew Detection Technique for Various Scripts
This paper includes the information about the technique used to detect Skew which are introduced during the scanning of the documents. It also discusses about the tool which have been used to implement the technique. The algorithm has been implemented on various scripts. The method provides a very efficient way to calculate the Skew. Correction in the skewed scanned document image is very impor...
متن کاملAn Improved Flower Pollination Algorithm with AdaBoost Algorithm for Feature Selection in Text Documents Classification
In recent years, production of text documents has seen an exponential growth, which is the reason why their proper classification seems necessary for better access. One of the main problems of classifying text documents is working in high-dimensional feature space. Feature Selection (FS) is one of the ways to reduce the number of text attributes. So, working with a great bulk of the feature spa...
متن کاملAn Improved Flower Pollination Algorithm with AdaBoost Algorithm for Feature Selection in Text Documents Classification
In recent years, production of text documents has seen an exponential growth, which is the reason why their proper classification seems necessary for better access. One of the main problems of classifying text documents is working in high-dimensional feature space. Feature Selection (FS) is one of the ways to reduce the number of text attributes. So, working with a great bulk of the feature spa...
متن کاملText Skew Angle Detection in Vision-Based Scanning of Nutrition Labels
An algorithm is presented for text skew angle detection in vision-based scanning of nutrition labels on grocery packages. The algorithm takes a nutrition label image and applies several iterations of the 2D Haar Wavelet Transform (2D HWT) to downsample the image and to compute the horizontal, vertical, and diagonal change matrices. The values of these matrices are binarized and combined into a ...
متن کامل